World Map



Spread of Coronavirus SARS-CoV-2

World Map with Confirmed and Death Cases (Data source (JHU 2020b), see also (JHU 2020a))

Total Confirmed for each Country

Total Deaths for each Country

World Map / Inhabitants

Spread of Coronavirus SARS-CoV-2

World Map with Confirmed and Death Cases (Data source (JHU 2020b), see also (JHU 2020a))

Total Confirmed for each Country per 100,000 Inhabitants

Total Deaths for each Country per 100,000 Inhabitants

World Map / Hot Spots

Spread of Coronavirus SARS-CoV-2

World Map with mean Confirmed and Death Cases over the past seven days (Data source (JHU 2020b), see also (JHU 2020a))

Mean Confirmed Cases for each Country per 100,000 Inhabitants

Mean Death Cases for each Country per 100,000 Inhabitants

Bar Chart



Bar Charts with descending order

Column

Bar Chart Cumulated Confirmed

Bar Chart Cumulated Deaths

Column

Confirmed and Deaths - Cumulated and Daily Cases (absolute date)

Case numbers are taken from (JHU 2020b). In order to compensate for the daily fluctuations, the mean case numbers for the past seven days (*_Mean) were also added.

Bar Chart / Inhabitants



Bar Charts with descending order

Column

Bar Chart Cumulated Confirmed per 100,000 Inhabitants

Bar Chart Cumulated Deaths per 100,000 Inhabitants

Column

Confirmed and Deaths - Cumulated and Daily Cases per 100,000 Inhabitants (relative date)

Population numbers are taken from (UNO 2020). In order to compensate for the daily fluctuations, the mean case numbers for the past seven days (*_Mean) were also added.

Bar Chart / Hot Spots



Bar Charts with descending order

Column

Bar Chart Mean Confirmed per 100,000 Inhabitants over past seven days

Bar Chart Mean Deaths per 100,000 Inhabitants over past seven days

Column

Confirmed and Deaths - Cumulated and Daily Cases per 100,000 Inhabitants (relative date)

Population numbers are taken from (UNO 2020). In order to compensate for the daily fluctuations, the mean case numbers for the past seven days (*_Mean) were also added.

Bar Chart - CFR



Bar Chart with descending order

Column

Bar Chart CFR Total - Case Fatality Rate (in %)

Bar Chart CFR Total (in %) - - selected countries

Column

Case Fatality Rate - Proportion of deaths from confirmed cases

The number of confirmed cases is an early predictor of the number of deaths. The number of today’s deaths is already determined by the infections about by \(\sim19\) days ago or respectively by the confirmed cases about by \(\sim11\) days ago (see RWI 2020).

An average duration of Confirmed Infection to Death of \(12\) (lag-)days (country-independent for the sake of simplicity) is assumed for the calculations, since many tests are carried out in the meantime before symptoms appear.

However, this varies considerably depending on country-specific test rate and health system. In the worst health systems it is only one day for recognized cases, the “Confirmed” cases must be “lagged” by \(\sim1\) day. In the best case, the time from the end of incubation period (in average \(\sim5-6\) days) to death is an average \(\sim19\) days. In this case, the average Confirmed infection to Death period is \(\sim14\) days), the “Confirmed” cases must be correctly “lagged” by \(\sim14\) days. For the assumed time periods see (RKI 2020c), (RKI 2020b), for Case Fatality Rate and Incubation Period in general see (Wikipedia contributors 2020a), (Wikipedia contributors 2020c).

The simple calculation with unlagged cumulated confirmed cases divided by cumulated deaths results in a significant underestimation of the CFR in health systems with early disease detection. If the number of cumulative cases is already large compared to the number of active cases (~ cases from the past two weeks), the “lagged” rsp. “unlagged” values converge.

The Infection Fatality Rate (IFR) is the fatality rate of all infection, that means detected confirmed cases and undetected cases (asymptomatic and not tested group). This lethality is assumed to be country independent and only rough estimates exist (RKI: bottom of existing estimates \(\sim0.56\%\)).

Case Fatality Rate in % - CFR_total and CFR_past_period (period of past 14 days) w/ Confirmed lagged by 12 days

Case Fatality Rate (in %) - selected countries

Country CFR_total CFR_past_period CFR_unlagged
Austria 1.7 1.4 1.0
France 3.7 1.2 2.6
Germany 2.7 1.0 1.9
India 1.6 0.9 1.5
Italy 8.7 2.1 5.3
Japan 1.9 1.6 1.7
Spain 3.6 1.3 2.9
United Kingdom 5.9 1.3 4.4
United States of America 2.8 1.4 2.5
EU 3.9 1.5 2.6

Countries - Table overview

Countries - Table overview

Column

Confirmed and Deaths - Cumulated and mean daily Cases, overall and per 100,000 Inhabitants (absolute and relative date)

Population numbers are taken from (UNO 2020). In order to compensate for daily fluctuations, the mean number of cases for the past seven days (*_Mean) is used instead of the daily cases.

EU - Table overview

European Union - Table overview

Column

Confirmed and Deaths - Cumulated and mean daily Cases, overall and per 100,000 Inhabitants (absolute and relative date)

Population numbers are taken from (UNO 2020). In order to compensate for daily fluctuations, the mean number of cases for the past seven days (*_Mean) is used instead of the daily cases.

Cumulated and Daily Trend



Cumulated and Daily Cases over Time

Row

World

Row

Selected Countries

China

Austria

France

Germany

Italy

India

Japan

Spain

EU

United Kingdom

United States of America

Reproduction Number



Germany - Rolling Mean and Reproduction Number

Row

Germany Rolling Mean of Daily Cases

Rolling Mean of Daily Cases

The 7-day Rolling Mean/Moving Average of the Daily Confirmed and Death Cases smooths out the short-term weekly fluctuations (weekend).

The daily confirmed cases are related to the left y-axes, the daily death cases are related to the right y-axes. This clearly outlines the 12 days delay relation between daily confirmed and death cases and also the roughly the factor of ~1/25 (~4%).

Row

Germany Calculated Reproduction Number

Calculated Reproduction Number

The calculation of the reproduction number \(R(t)\) uses a R function provided by (Thomas Hotz 2020b) on GitHub.

The (effective) reproduction number \(R(t)\) at day \(t\), i.e. the average number of people someone infected at time \(t\) would infect if conditions remained the same.

For further German federal states figures (based on the data provided by Robert Koch Institut) see (Thomas Hotz 2020a) and for worldwide figures (based on the data provided by Johns Hopkins University) see (Thomas Hotz 2020c).

For the calculation the assumption of 7-days reporting delay (confirmed is reported 7-days after ‘real’ infection) is unchanged and the same modelled infectivity profile w is used. The lower and upper confidence interval lines provide the (approximate, pointwise) 95% confidence interval (only based on statistical numbers, possible changes in e.g. counting measures can not be considered).

This is the reason why (Thomas Hotz 2020b) “do not compute an average over a sliding window of seven days so the viewer immediately recognizes the size of such artefacts, warning her to be overly confident in the results. In fact, these artefacts are much larger than the statistical uncertainty due to the stochastic nature of the epidemic which is reflected in the confidence intervals.”

Nevertheless, here the calculation is based on the 7-days rolling mean and therefore the figure smooths over the the weekly rhythm.

Virus Spread on log10 scale



Column

Exponential Growth Evaluation

China and South Korea slowed down exponential growth significantly at an early stage. Their lines in the diagram with the log10 scale have therefore no longer had a significant slope.

In early phases countries have a more or less unchecked exponential growth. If countermeasures are effective, reduced exponential growth is reflected in a reduced slope of the accumulated cases again.

Column

Virus Spread with log10 scale (since mid of Jan)

Exponential Growth



Column

Estimation spread of the Coronavirus with Linear Regression of log data

Exponential Growth and Doubling Time \(T\)

Exponential growth over time can be fitted by linear regression if the logarithms of the case numbers is taken. Generally, exponential growth corresponds to linearly growth over time for the log (to any base) data (see also Wikipedia contributors 2020b).

The semi-logorithmic plot with base-10 log scale for the Y axis shows functions following an exponential law \(y(t) = y_0 * a^{t/\tau}\) as straight lines. The time constant \(\tau\) describes the time required for y to increase by one factor of \(a\).

If e.g. the confirmed or death cases are growing in \(t-days\) by a factor of \(10\) the doubling time \(T \widehat{=} \tau\) can be calculated with \(a \widehat{=} 2\) by

\(T[days] = \frac {t[days] * log_{10}(2)} {log_{10}(y(t))-log_{10}(y_0)}\)

with

\(log_{10}(y(t))-log_{10}(y_0) = = log_{10}(y(t))/y_0) = log_{10}(10*y_0/y_0) = 1\)

and doubling time

\(T[days] = t[days] * log_{10}(2) \approx t[days] * 0.30\).

For Spain, Italy, Germany we have had a doubling time of only \(T \approx 9days * 0.3 \approx 2.7 days\) at the beginning of the pandemic!!.

The doubling time \(T\) and the Forecast is calculated for following selected countries: Austria, France, Germany, Italy, India, Japan, Spain, EU, United Kingdom, United States of America and World in total (see Forecast / Doubling Time).

Germany - Trend with Forecast on a linear scale

Forecast Plot - next 14 days

The plot shows the daily cases forecast increase in case of unchecked exponential growth. The dark shaded regions show the 80% rsp. 95% prediction intervals. These prediction intervals are displaying the uncertainty in forecasts based on the linear regression of the logarithmic data over the past 21 days.

Column

Comparison Exponential Growth (Daily rolling mean cases on log10 scale)

Germany - Cumulated cases with ~linear slope on a log10 scale

Compare Exp vs Linear Growth



Column

Comparison Exponential vs. Linear Growth

The charts compare the different forecasts for an exponential rsp. linear growth model. Due to the large fluctuations of the daily cases regression of three weeks is required. Otherwise the prediction levels are much too big.

The dark shaded regions are indicating the \(80\%\) rsp. \(95\%\) prediction intervals. These prediction intervals are displaying the “pure” statistical uncertainty in forecasts based on the regression models.

For doubling periods in the order of period of infectivity (RKI assumption: \(\sim9-10\) days, with great uncertainty, see (RKI 2020b), we no longer have exponential growth. The “old” infected cases are at the end of the doubling period no longer infectious (active). This results in a constant infection rate with basic reproduction number \(R_t \sim 1\) or even \(<1\).

Note: for case numbers of German federal states see (RKI 2020a).

Column

Daily Rolling Mean Cases - Comparison Exponential and Linear Growth

Cumulated Cases - Comparison Exponential and Linear Growth

Doubling Time / Forecast Accuracy



Column

Doubling Time and Forecast The forecasted cases for the next 14 days are calculated ‘only’ from the linear regression of the logarithmic data and are not considering any effects of measures in place. In addition data inaccuracies are not taken into account, especially relevant for the confirmed cases.

Therefore the 14 days forecast is only an indication for the direction of an unchecked exponentiell growth.

The cumulated cases doubling rate is only a good indicator at the beginning of the pandemic. If the number of confirmed cases from the past two weeks is already small compared to the total number of confirmed cases, the number of infectious people is also small compared to the total cases.

Therefore the table below provides the doubling time for the daily rolling mean cases. The forecast is based on the linear regression of the logarithmic data of past 21 days.

Forecast (FC) Daily Rolling Mean Cases: Doubling Time (days), Forecasted cases next day/tomorrow and Forecasted cases in 14 days
Country Case_Type T_doubling Reg_last_day FC_next_day FC_14days
Austria Confirmed 9.2 4’226 4’556 12’117
EU Confirmed 12.5 210’660 222’659 457’487
France Confirmed 13.5 48’585 51’138 99’509
Germany Confirmed 10.3 16’951 18’127 43’372
India Confirmed -30.3 43’448 42’465 31’534
Italy Confirmed 8.0 30’177 32’900 101’148
Japan Confirmed 45.0 674 684 836
Spain Confirmed 17.0 22’111 23’032 39’149
United Kingdom Confirmed 30.5 24’471 25’034 33’644
United States of America Confirmed 27.2 83’599 85’754 119’380
World Confirmed 30.9 511’757 523’378 700’785
Austria Deaths 8.8 22 24 68
EU Deaths 10.6 1’647 1’758 4’122
France Deaths 9.1 361 390 1’052
Germany Deaths 10.3 68 72 174
India Deaths -23.9 500 486 333
Italy Deaths 6.8 248 274 1’026
Japan Deaths 24.1 8 9 13
Spain Deaths 33.3 171 175 229
United Kingdom Deaths 11.9 282 299 638
United States of America Deaths 73.6 839 847 957
World Deaths 51.1 6’568 6’657 7’940

Column

Forecast of Daily and Cumulated Cases and check of Accuracy

The forecast accuracy is checked by using the forecast method for the past three weeks before the past week (training data). Subsequent forecasting of the past week enables comparison with the real data of these days (test data).

The comparison is also an early indicator if the exponential growth is declining. However, possible changes in underreporting (in particular the proportion confirmed / actually infected) requires careful interpretation.

For doubling periods of the total cumulated cases in the order of infectivity (RKI assumption: \(\sim9-10\) days, with great uncertainty, (see RKI 2020b), we have no
exponential growth for the total cumulated cases. Since the “old” infected cases are no longer infectious after these periods and we then have a constant infection rate with basic reproduction number \(R_t \sim 1\).

Instead, we have “only” linear growth of the cumulative Confirmed Cases and the Daily Confirmed Cases remain more or less constant if \(R_t \sim 1\).

However, the basic reproduction number \(R_0 (\approx 3.3 - 3.8)\) (RKI 2020d) is a product of the average number of contacts of an infectious person per day, the probability of transmission upon contacts and the average number of days infected people are infectious. With the current uncertainty of the average duration of the infectivity duration, \(R_0\) can therefore be estimated from the doubling time only to a very limited extent. See also (CMMID 2020).

Germany - Forecast Daily Rolling Mean Cases based on past 21 days

Germany - Forecast Accuracy Check Daily Rolling Mean Cases for the past 7 days

Forecast w/ CFR



Column

Forecasting with lagged Predictors

The number of confirmed cases is an early predictor of the number of deaths. The number of today’s deaths is already determined by the infections about by \(\sim19\) days ago or respectively by the confirmed cases about by \(\sim11\) days ago see Bar Chart - CFR.

The country specific case fatality rate (CFR, proportion of deaths from confirmed cases) indicates the country specific testing rate and may depend on quality/capacity of hospitals.

Overall a rough conclusion on the country specific underreporting rate (lack of diagnostic confirmation; proportion of all infected to confirmed cases) is feasible if the infection fatality rate (IFR, confiremd cases plus all asymptomatic and undiagnosed infections) is assumed to be country independent and the IFR is known (bottom of existing estimates \(\sim0.56\%\), assumption by RKI see (RKI 2020b).

In this case an estimation of the CFR of \(0.06\) \((6\%)\) indicates an underreporting by a by a factor of \(\sim10\). A CFR of \(0.20\) \((20\%)\) indicates an underreporting by a by a factor of \(\sim30\). This corresponds to RKI assumption of a underreporting by a factor of \(11-20\) (RKI 2020c). Unfortunately, the IFR or lethality is still far too imprecise for concrete conlusions.

In the model paper RKI assumes for the

  • Incubation period \(\sim5-6\) days - Day of infection day until symptoms are upcoming)
  • Hospitalisation \(+4\) days - Admission to the hospital (if needed) after Incubation Period)
  • Average period to death \(+11\) - if the patient dies, it takes an average of \(11\) days after admission to the hospital

Depending on the country-specific test frequency (late or early tests), the

*lag_days - time from receipt of the confirmed test result to death, Confirmed to Death, is about \(11-13\) days.

Note: these methods are also used for example for advertising campaigns. The campaign impact on sales will be some time beyond the end of the campaign, and sales in one month will depend on the advertising expenditure in each of the past few months (see Hyndman and Athanasopoulos 2020).

Column

Daily Confirmed and Death Cases

Lag days and Case Fatality Rate (CFR)

Case Fatality Rate in % (CFR, proportion of deaths from confirmed cases) lagged by 12 days; total and for period of past 14 days
Country CFR_total CFR_past_period CFR_unlagged
Germany 2.7 1.0 1.9
Italy 8.7 2.1 5.3
Spain 3.6 1.3 2.9

Column

Daily Deaths Forecast depending on lagged Daily Confirmed Cases (ARIMA model)

Exampla Germany - White Noise of Forecast Residuals

Forecast residuals
indicate quality of
Arima model fit:

References



Data Source

Data Source

Data files are provided by Johns Hopkins University on GitHub
https://github.com/CSSEGISandData/COVID-19/tree/master/csse_covid_19_data/csse_covid_19_time_series

  • Data files:
    • time_series_covid19_confirmed_global.csv
    • time_series_covid19_deaths_global
    • time_series_covid19_recovered_global.csv

The data are visualized on their excellent Dashboard
Johns Hopkins University Dashboard
https://coronavirus.jhu.edu/map.html

Code Source

Directory with all R Sources is replicated in GitHub repository:
https://github.com/WoVollmer/R-TimesSeriesAnalysis/tree/master/Corona-Virus

Code is based on ideas from https://rpubs.com/TimoBoll/583802

Bibliography

CMMID. 2020. “Temporal Variation in Transmission During the Covid-19 Outbreak.” GitHub. https://cmmid.github.io/topics/covid19/current-patterns-transmission/global-time-varying-transmission.

Hyndman, Rob J, and George Athanasopoulos. 2020. “Forecasting: Principles and Practice.” Online textbook. https://otexts.com/fpp3/lagged-predictors.html.

JHU. 2020a. “Coronavirus - Dashboard.” Internet. https://coronavirus.jhu.edu/map.html.

———. 2020b. “Coronavirus - Time Series Data.” GitHub. https://github.com/CSSEGISandData/COVID-19/tree/master/csse_covid_19_data/csse_covid_19_time_series.

RKI. 2020a. “Coronavirus - Fallzahlen.” Germany: Internet. https://www.rki.de/DE/Content/InfAZ/N/Neuartiges_Coronavirus/Fallzahlen.html?nn=13490888.

———. 2020b. “Coronavirus - Modellierung.” Germany: Internet. https://www.rki.de/DE/Content/InfAZ/N/Neuartiges_Coronavirus/Modellierung_Deutschland.html.

———. 2020c. “Coronavirus - Steckbrief.” Germany: Internet. https://www.rki.de/DE/Content/InfAZ/N/Neuartiges_Coronavirus/Steckbrief.html.

———. 2020d. “Schätzung Der Aktuellen Entwicklung Der Sars-Cov-2-Epidemie in Deutschland - Nowcasting.” Germany: Internet. https://www.rki.de/DE/Content/Infekt/EpidBull/Archiv/2020/17/Art_02.html?nn=13490888.

RWI. 2020. “Corona-Pandemie: Statistische Konzepte Und Ihre Grenzen.” Germany: Internet. http://www.rwi-essen.de/unstatistik/101/.

Thomas Hotz, Stefan Heyder, Matthias Glock. 2020a. “Monitoring Der Ausbreitung von Covid-19 Durch Schätzen Der Reproduktionszahl Im Verlauf Der Zeit Konzepte Und Ihre Grenzen; Deutschland.” Germany: Internet. https://stochastik-tu-ilmenau.github.io/COVID-19/germany.

———. 2020b. “Monitoring the Spread of Covid-19 by Estimating Reproduction Numbers over Time; Technical Report.” Germany: Internet. https://stochastik-tu-ilmenau.github.io/COVID-19/reports/repronum/repronum.pdf.

———. 2020c. “Monitoring the Spread of Covid-19 by Estimating Reproduction Numbers over Time; World & Selected Countries.” Germany: Internet. https://stochastik-tu-ilmenau.github.io/COVID-19/index.html.

UNO. 2020. “World Population Prospects 2019.” Internet. https://population.un.org/wpp/Download/Standard/Population/.

Wikipedia contributors. 2020a. “Case Fatality Rate — Wikipedia, the Free Encyclopedia.” https://en.wikipedia.org/w/index.php?title=Case_fatality_rate.

———. 2020b. “Exponential Growth — Wikipedia, the Free Encyclopedia.” https://en.wikipedia.org/w/index.php?title=Exponential_growth.

———. 2020c. “Incubation Period — Wikipedia, the Free Encyclopedia.” https://en.wikipedia.org/w/index.php?title=Incubation_period.